Probabilistic Risk Assessment for Resource Provision in Grid

نویسندگان

  • Raid Alsoghayer
  • Karim Djemame
چکیده

Service Level Agreements (SLAs) are introduced to overcome the shortages of best-effort approach in Grid computing and make Grid computing more attractive for commercial uses. Yet commercial Grid providers are not keen to adopt SLAs, since there is a risk of SLA violation, which will result in a penalty fee. This paper analyses failure data collected from three different Grid sites. We study the statistics of the data including the root cause, the mean time to repair and time between failures. We find that software and hardware failures are the largest contributors, and that the time to repair varies, depending on the root cause, from 13 hours in network errors to around 46 hours in unknown errors. We also find that the repair time is well modelled by a Weibull distribution. From the analysis of the historical data we find that the distribution between failures in a Grid system is well modelled by a Weibull distribution with decreasing hazard rate, and this can be used by a resource provider to predicate the risk of failure.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Predictive Probabilistic and Predictive Possibilistic Models for Risk Assess- ment in Grid Computing

We show a hybrid probabilistic and possibilistic model for assessing the risk of a service level agreement for a computing task in a cluster/grid environment. Using the predictive probabilistic approach we develop a framework for resource management in grid computing, and by introducing an upper limit for the number of failures we approximate the probability that a particular computing task is ...

متن کامل

Stability Assessment Metamorphic Approach (SAMA) for Effective Scheduling based on Fault Tolerance in Computational Grid

Grid Computing allows coordinated and controlled resource sharing and problem solving in multi-institutional, dynamic virtual organizations. Moreover, fault tolerance and task scheduling is an important issue for large scale computational grid because of its unreliable nature of grid resources. Commonly exploited techniques to realize fault tolerance is periodic Checkpointing that periodically ...

متن کامل

Exposure Assessment of Total Mercury: A Probabilistic-Approach Study Based on Consumption of Canned Fish

Introduction: Exposure to mercury (Hg) by consumption of fish is a recent health concern. So, it is important to evaluate the health risks related to canned fish consumption. The purpose of this study was to investigate the potential health risk based on Hg concentration in people who consumed canned fish with a probabilistic approach in Isfahan City, the central province in Iran. Materials an...

متن کامل

Applicable risk assessment methods in occupational and environmental exposure to nanoparticles - a narrative review

Nanoparticles (NPs) are a heterogeneous group of materials that have various applications, and their risk assessment is an essential condition. This study aimed to review the applicable risk assessment methods in occupational and environmental exposures to NPs. A literature search for articles published since 2005 in Web of Knowledge, Scopus, PubMed, Science Direct, and Google Scholar, using ap...

متن کامل

An Optimization Model for Financial Resource Allocation Towards Seismic Risk Reduction

This paper presents a study on determining the degree of effectiveness of earthquake risk mitigation measures and how to prioritize such efforts in developing countries. In this paper a model is proposed for optimizing funds allocation towards risk reduction measures (building retrofitting) and reconstruction process after potential earthquakes in a regional level. The proposed model seeks opti...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009